Draft version December 6, 2010 

Preprint typeset using L£-T^X style emulatcapj v. 11/10/09 



O 

(N 

O 

Q 



O 

U 

6 
to 

> 
O 

in 
rn 

od 
O 

o 



X 



INCREASING THE FISHER INFORMATION CONTENT IN THE MATTER POWER SPECTRUM BY 

NON-LINEAR WAVELET WIENER FILTERING 

Tong-Jie Zhang 1,2,3 , Hao-Ran Yu 1,3 , Joachim Harnois-Deraps 3,4 , Ilana MacDonald 3,5 and Ue-Li Pen 3 

department of Astronomy, Beijing Normal University, Beijing, 100875, P. R. China 
2 Center for High Energy Physics, Peking University, Beijing, 100871, P. R. China 
3 Canadian Institute for Theoretical Astrophysics, University of Toronto, M5S 3H8, Ontario, Canada 
4 Department of Physics, University of Toronto, M5S 1A7, Ontario, Canada 
5 Department of Astronomy and Astrophysics, University of Toronto, M5S 3H8, Ontario, Canada 

Draft version December 6, 2010 

Abstract 

We develop a purely mathematical tool to recover some of the information lost in the non-linear 
collapse of large-scale structure. From a set of 141 simulations of dark matter density fields, we 
construct a non-linear Wiener filter in order to separate Gaussian and non-Gaussian structure in 
wavelet space. We find that the non-Gaussian power is dominant at smaller scales, as expected from 
the theory of structure formation, while the Gaussian counterpart is damped by an order of magnitude 
on small scales. We find that it is possible to increase the Fisher information by a factor of three before 
reaching the translinear plateau, an effect comparable to other techniques like the linear reconstruction 
of the density field. 

Subject headings: cosmology: theory — dark matter — large-scale structure of universe — methods: sta- 
tistical 



1. INTRODUCTION 

In modern cosmology, the power spectrum of mat- 
ter fluctuations, P(k), is a key measurement, as it 
is related to many cosmological parameters. Re- 
cent obs ervations such as the cosmic microwave back- 
ground (jKomatsu et al.l I2009T) . the correlation func- 
tion of galaxies (Eisenstein et all 120 05: Tcgmar k et al.1 
120061 iPercival et al.1 l2007bl )al. 120101 ) . the strong gravi- 
tatio n al lensing probability for image s eparation (jChenl 
[2001 IChen fc Zhaol [20061: lOguril l2006n . and the shear 
correlation func t ion for weak gravi t ational lensing 
dPen et al.l 120031: iHoekstra et al.l 120061: iBeniamin et al.1 
120071 IFu et al J 120081 ) . have been able to but constraints 
on many parameters. In many analyses, cosmologists are 
interested in measuring the matter power spectrum on 
large scales, which holds a wealth of cosmological infor- 
mation that has not been erased by the non-linear grav- 
itational collapse. The constraining strength of a survey 
depends directly on the amount o f Fisher information 
(|Fisherlll935l iTegmark et all 119971 ). i.e. statistically in- 
dependent Fourier modes, contained in the measurement, 
and one needs to maximize the information in order to 
minimize the uncerta in ty on c osmol ogical parameters. 

iRimes fc Hamilton! (|2005l I2006D first studied the 
amount of Fisher information as a function of scale con- 
tained in the matter power spectrum. They estimated 
the information content about the initial amplitude of 
the linear power spectrum from an ensemble of many N- 
body simulations and showed that the Fisher information 
is preserved in the matter power spectrum on large, linear 
scales. They also observed that the information plateaus 
in the translinear regime (fc ~ 0.2 — 0.8/i/Mpc), and that 
there is a second increase in in formation when i nclud- 
ing the small, non-linear scales. iNevrinck et al.l (2006) 
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then found that the halo model also predicts this be- 
havior on translinear scales in the information content, 
which comes largely from cosmic variance in the num- 
ber o f the largest haloes i n a g iven volume. Further- 
more, [NeyrincF&ISzacud! (|2007l ) found that the cosmo- 
logical Fisher information in the matter power spectrum 
about many key cosmological parameters is quite degen- 
erate both on translinear and non-linear scales. They 
also showed that it can be difficult to constrain the ini- 
tial cosmological conditions from the dark matter power 
spectrum on non-linear scales, and suggested the removal 
of the largest haloes as an alterna tive to resolve this 
roblem. B as ed on the t e chniq ue of IRimes fc Hamilton! 
20051 12006D . iLee fc Pen! (|2008l) estimate the Fisher in- 
formation in the galaxy angular power spectrum from 
the photometric redshift catalogue of the Sloan Digital 
Sky Survey Data Release 5. Their results also show that 
very little new information is contained in the matter 
density power spectrum on the translinear scale, which 
is observ ationally consistent with the previous numeri- 
cal tr end (IRimes fc Hamilton 2005, 2006; Hamilton et al.l 
[2001 INevrinck et alJl2006t INevrinck fc Szapudil 120071 ) . 

Many methods have been proposed to recover parts 
of this lost information, notabl y running TV-body 
simul ations backwards in time ([Goldberg fc Spergell 
2000), or density field r e constr u ction from lin- 
ear theory (lEisenstein et all [20071: iNoh et al l 120091 : 



iPadmanabhan et alJl2009f ). INevrinck et al.1 (|2009l ) intro- 
duces a method of logarithmic mapping of the density 
field, successfully recover great amount of in formation 
conten t in the dark matter power spectrum. Weinberg 
( 1992) uses a method called Gaussianization, which is a 
monotonic transformation of the smoothed galaxy dis- 
tribution to reconstruct primordial density fluctuations. 
Here we attempt to Gaussianize the density fields by 
non-linear Wiener filtering. Our objective is to filter out 
the non-linear structure, leaving behind a density field in 
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which the amount of Fisher information has increased. 

Structure formation in current theories starts from an 
initially linear Gaussian random field, which progres- 
sively becomes non-linear through gravitational instabil- 
ities, starting at the smallest scales. This process is local 
in real space, and only the linear part of the field re- 
mains intrinsically Gaussian. In the non-linear regime, 
the Fourier modes start to couple together, and the field 
is no longer Gaussian. It is legitimate to ask how to 
filter out the non-linearities. Wavelet functions provide 
a natural basis that compromises between the locality 
in real and Fourier space. Moreover, it can simultane- 
ously decompose the data, functions or even operators 
in coordinate (or time) space and scale (or frequency) 
space. Therefore, wavelet transforms are an ideal nu- 
merical technique for extracting multi-scale information 
(iFang k Thewsl|l998[K as required by our Wiener filter. 
IPando k Fane! (] 19961 ) applied this method to the high 
rcdshift La syste ms, which often have separations on 
translinear scales. iPenl (|1999() also discussed the applica- 
tion of wavelets to filter the Gaussian noise contained in 
some images, and found that the wavelet basis is highly 
advantageous over the standard Fourier basis if the data 
are intermittent in nature. 

In this paper, we apply the wavelet method in order 
to push the translinear plateau further in the informa- 
tion content, without removing the largest haloes in the 
analysis. The outline of the paper is as follows. In §2, 
we present the TV-body code that was used to simulate 
the dark matter density fields. In §3, we discuss the ap- 
plication of the discrete wavelet transform (DWT) and 
Bayesian theory to filter out some of the non-Gaussian 
component of the density fields. In §4, we compare the 
power spectra of the filtered and unfiltcrcd data, while in 
§5 we analyze the information content in both data sets. 
Discussion and conclusions are presented in §6. 

2. JV-BODY SIMULATIONS OF DARK MATTER DENSITY 

FIELDS 

We use the cosmological simulation code CubeP3M 
(CITA Computing 2008), an upgraded version of PM- 
FAST (IMerz et al.ll2~00l that uses a cubical decomposi- 
tion instead of slabs in the global FFT, and thus scales 
to much larger runs. In addition, the gravitational force 
is now calculated down to the subgrid level, which in- 
creases the code resolution. Finally, it is both MPI and 
openmp parallelized, perfect for running on most com- 
puter clusters. 

We run 141 simulations with a box size of 600 
Mpc and a resolution of 512 3 cells and 256 3 particles. 
The initializ ation step of the simulatio n first reads a 
CMBFAST (jSeljak k Zaldarriagal [19961 ) transfer func- 
tion, which models our fiducial cosmology, and evolves 
the power linearly to an initial redshift of z = 200. It 
then uses the Zel'dovich approximation to calculate a dis- 
placement field and a velocity field, which are assigned to 
the initial particles. The cosmological parameters used 
are M = 0.29, Q A = 0.71, h = 0.73, a 8 = 0.84, and 
n s = 0.95. The code then evolves these initial densities 
up to z = 0. One of those density fields is plotted in 
Figf3]Ja). For the purpose of visualization, the 3D field is 
projected onto a 2D plane by averaging over part of the 
third dimension. 



3. NON-GAUSSIAN FILTERING OF DENSITY FIELDS 

3.1. Discrete wavelet transform (DWT) 

Similar to the Fast Fourier Transform (FFT), the Dis- 
crete Wavelet Transform (DWT) is an invertible, linear 
operation, and can be considered as a rotation in function 
space. Both operations transform a data vector, whose 
length must be an integer power of 2, to a numerically dif- 
ferent vector of the same length. The FFT's basis func- 
tions are sines and cosines, which, of course, are not local- 
ized in real space. The DWT, on the other hand, has two 
kinds of basis functions, called scaling functions (mother 
functions) and difference functions (wavelet functions). 
Unlike sines and cosines, individual DWT basis functions 
are localized both in real and frequency space, and be- 
cause they vary both in scales and locations, they enable 
us to analyze more accurately the fine variations in a 
data set. 

Although there is an infinite number of possible forms 
of DWT basis functions, only a few have a clear mean- 
ing. The simplest discrete wavelet is the Haar wavelet, 
which suffers from the fact that its basis functions are dis- 
continuous and provide a poor approximation to smooth 
functions. We consider in this paper the Daub echies-4 
wavelet basis (hereafter DB-4) (|Daubechieslll992l ). which 
contains families of scaling functions and wavelets that 
are orthogonal, continuous and have compact support. 

We denote the one-dimensional scaling function and 
wavelet function as </> and tp. For simplicity we first con- 
sider a one-dimensional vector d(x) with length L = 2 J . 
Using DB-4 wavelet, it can be expanded as 

1 J-12 J -1 

d{x) = £Q,lfoA x ) + Y Y ej,ii>jA x ), (!) 

1=0 j=0 1=0 

where j and I can be regarded as a dilation and a trans- 
lation of the basis function: 

<t> j ,i(x) = ^cf > (V X /L-l) (2) 

^jAx) = \[^^x/L-1). (3) 

e^i and (ijj are called scaling function coefficients (here- 
after SFCs) and wavelet function coefficients (hereafter 
WFCs). They can be calculated by 

€jj = J d(x)4>jj(x)dx, (4) 

and 

ij t i = J d(x)t/jj t i(x)dx. (5) 

Any particular set of wavelets is specified by a corre- 
sponding set of wavelet filter coefficients. The most lo- 
calized Daubechies wavelet is DB-4 wavelet. Its scaling 
and wavelet functions have 4 coefficients, ao, ...,03 and 
6_i, ...,61 respectively. 

a = (1 + V3)/4, 01 = (3 + V3)/4, 
a 2 = (3 - V3)/4, a 3 = (1 - V3)/4, 



Increasing the Fisher Information by Wavelet 



3 



and 

6- 2 = (1 - \/3)/4, 6_ x = (V3 - 3)/4, 

& = (3 + V3)/4, 6i = (V3-l)/4 

The wavelet transform of a vector of data is done first by 
applying a wavelet coefficient matrix to the whole vector, 
and then applying a "smaller" matrix to the smoothed 
vector of length L/2. For DB-4 wavelet transform, this 
was done until only two SFCs were left. Finally, two 
SFCs store the information of the mean of the largest 
scale of the vector, and L — 2 WFCs store the informa- 
tion of the fluctuations or the differences between regions 
of the vector. For 3D simulation data, the process is sim- 
ilar. The data is wavelet transformed in one arbitrary di- 
rection xi, the result is transformed in another direction 
X2, and then in x%. Finally, the result contains 2 3 = 8 
SFCs and 512 3 - 8 WFCs. 

3.2. Non-linear filtering 

We now have the DWT transform of those density 
fields which we need to filter. Because of the linear- 
ity of wavelet transforms, we can think of the data as 
being filtered into two pieces, the inverse wavelet trans- 
forms of which are called the Gaussianized density d a and 
the non- Gaussianized density df,. The unfiltered density 
fields d consist of the sum of these two parts, and the 
hope is that the Gaussianized density will be closer to lin- 
ear theory. Since the filtering process is in wavelet space, 
we divided the wavelet transform of the density field D 
into the Gaussianized part A and the non-Gaussianized 
part B. Again, because of the linearity of wavelet trans- 
forms, we have 

d = d a + d b (6) 

in real-space and 



D = A + B. 



(7) 



in wavelet space. 

The method of thresholding and Gaussian Wiener fil- 
tering is not suitable in this work, because we need to fil- 
ter localized structure, so we use a non-Gaussian Wiener 
filter instead. In wavelet ba sis the non-Gaussianity is 
clearly represented (|Penlll999[) , so the filter is applied on 
each wavelet modes. Each wavelet mode has three j 's 
to characterize the mode's scale, and we denote wavelet 
mode s with s. W e regard A as Gaussian white noise N, 
as in iPenl (|1999| ) , B as the original image U, and D as 
the noisy image D = A + B. For each wavelet mode, 
we assume that the WFCs A(r, s), at fixed sclae s, has 
the form of a Gaussian distribution Af(0,(J 2 ) over posi- 
tion r, and if we know the PDF of D(r, s), all the modes 
are statistically independent. We denote the PDF for a 
given mode of B(r, s) as 6(6), and we obtain the variable 
D(r,s) with PDF 



' r Q(b)exp(~(d-b) 2 )6b (8) 



f(d) = 



According to Bayes's theorem, we can calculate the con- 
ditional probability P{B\D) = P{D\B)P(B)/P{D). For 
the posterior conditional expectation value, we get 



(B\D = d) = J exp[~(u - d) 2 ]Q(u)udu 



J^—d d Jexp[-±(u-d) 2 }Q(u)du (10) 



B = D 

V2tt/ 

B = D+(\nf)'{d). 
We define two filter functions 

a(s) = A/D, 
p(s)=B/D, 



so 



a(s) = - 



(ln/)'(d) 



/3(s) - 1 + 



(W)'(d) 



(ii) 

(12) 
(13) 

(14) 
(15) 



0) 



In order to realize the filtering of the actual data, we 
have to estimate the prior distribution for each wavelet 
mode. 

We assume that the one-point distribution of modes 
is non-Gaussian and that the modes are statistically in- 
dependent, and so we need to specify the PDF for each 
mode. In th is work we use Cartesian product wavelets 
(|Meverlll995D . This basis for our density fields has three 
scales 2 J ~^, 2 J ~^ and 2 J ~i' d (in unit of pixels), which 
are the length, width and height of the cuboid each 
wavelet support in real-space. We define the volume of 
the cuboid in real-space as the equivalent volume V e . For 
the real data, the structures dealt with have different 
physical scales and specific locations in the 3D density 
field. Consequently, the optimal basis must also depend 
both on the scale and the location. As mentioned before, 
our wavelet basis, which is localized both in frequency 
space and in real-space, are perfectly well suited to ef- 
fectively extracting the information of these structures. 
Every WFC can be seen as the dot product of the density 
field and a wavelet of a particular scale and shape. 

Since the modes are statistically independent, it is nat- 
ural to collect together all the WFCs with the same cor- 
responding cuboids, and then to obtain the PDF of this 
scale. Actually, we do not differentiate the direction of 
the cuboids and consider only the combination of ji , j2 
and j 3 . That is to say, we regard (ji,j2,h) = (1,2,3) 
and (ji,ju,ja) — (1,3,2) as the same cuboids, and the 
corresponding WFCs are collected together. Note that 
there may be more than one combination whose equiva- 
lent volumes are the same. 

However, fewer WFCs are available for larger scales, or 
greater equivalent volumes, and thus the PDF has lower 
statistics, which makes the calculation of filter functions 
less accurate. If there are not enough WFCs, or if im- 
proper statistics are used, the appearance of fluctuation 
in the PDF, especially the region near zero where the 
property of Gaussianity is detected, reduces our preci- 
sion in the determination of the filter functions. How- 
ever, since we collect the WFCs over 141 density fields 
we have enough statistics to make both the PDF and 
filter functions smooth. Also, in counting the WFCs, 
we choose an adequate set of intervals, or bins, in order 
to best represent the shape of the PDF. The range of 
the intervals must both be large enough to contain the 
non-Gaussian 'tails' in the PDF, and small enough to 
show the detailed properties of Gaussianity. We discuss 



4 



Zhang et al. 




15 20 

Value of WFCs 

Fig. 1. — Filter functions a and /3 as a function of the WFCs 
value, 9 wavelet modes are chosen. Each mode is described by a 
combination of three lengths, as seen in the legend, which form the 
cuboid that the DWT basis function governs. Moving from left to 
right in the plot corresponds to moving from top to bottom in the 
legend. As WFCs are small, one can see that a is simply unity, 
where Gaussianity dominates. 

these in the next paragraph. We find that the WFCs on 
different scales have different distributions. The curves 
of PDFs where WFCs value are relatively small appear 
to be Gaussian, but there are non-Gaussian 'tails' for 
larger values of PDFs. It seems that if those WFCs be- 
long to larger scales, or larger V e , the 'boundary' value 
between Gaussianity and non-Gaussianity is larger. We 
plot 9 filter functions in FigfT] to compare the shape of 
the filter function curve between scales. This property is 
more easily visualized if we define a critical value on each 
scale, defined as the critical value of WFC x c that yields 
a = 0.9. In FigH]we plot that critical value x c versus V e . 

The CDF of a set of data is obtained by sorting and by 
doing derivatives to get the PDF. However, it is challeng- 
ing to sort all the WFCs on one scale among hundreds 
of density fields. Instead, we first construct several hun- 
dreds of interval bins, and then read every density field 
datum to check the WFCs on each scale, and find which 
interval each of these WFCs belongs to. Actually, there 
are a total 120 different scales, so we need to count all 
the WFCs on each scale and on each of the 141 densities 
to produce our final 120 PDFs of WFCs. Then, we check 
each of these PDFs, and determine whether the intervals 
properly represent the shape of PDFs. If not, we may 
try another series of bins, get a new set of PDFs, and 
iterate until all the PDFs have been measured. 

We also use i) the same set of simulations but with 
lower resolution, and ii) new set of simulations with finer 
resolution, and get similar PDFs of WFCs (thus filter 
functions) on same physical scales. As a result, the res- 
olution of simulations does not affect WFCs' PDF if the 
physical scales are the same. 

From the relationship between the critical values and 
V e , we can see that for smaller scales, the critical values 
are smaller, such that the signal contributes more to the 
non-Gaussian density field, as expected. 

We finally filter all the data by multiplying the den- 
sities in wavelet space with our filter functions, Eqfl4] 



Critical value x c I a „ 90 of WFCs thai separate 
Gaussianity and non-Gaussianity in each mode 



Volume of each mode V in unit of Mpc 

Fig. 2. — Critical value of WFC x c of the scales versus the 
equivalent volume V e of the basis function. The x c is defined via 
a(x c ) = 0.9, which can be regard as a boundary between Gaus- 
sianity and non-Gaussianity on that scale. 

and Eqfl5l which separates those 141 density fields into 
Gaussian and non-Gaussian contributions. Note that we 
do not filter the SFCs because they only govern the mean 
of the largest scales and have no significant impact on the 
power spectra. We plot a slice of an unfiltered density 
and its Gaussianized and non-Gaussianized components 
in FigES 

4. COMPARISON OF POWER SPECTRA 

The power spectrum is the Fourier transform of the 
correlation function and measures the amount of cluster- 
ing in the matter distribution. It is measured in terms 
of the wavenumber k in units of hMpc^ 1 . In the case of 
a Gaussian random field, the power spectrum describes 
completely the statistics of the distribution, as all high 
moments either vanish or factor into its explicit func- 
tions. The power spectrum from each simulation is de- 
fined as 



(|r5(M')|> 2 = (2ir) 3 P(k)6(k-k>), 



(16) 



where (\5(k, k')\) 2 is the volume average of the expecta- 
tion value and P(k) is the power spectrum. Of equal 
interest is A|, the power spectrum in its dimensionless 
form, defined as: 



At 



k 3 P(k) 

27T 2 ' 



(17) 



For the 141 simulations, Gaussianized and non- 
Gaussianized densities are generated, as described in 
the previous section. The power spectra of these 423 
mass distributions are calculated using the 'Nearest Grid 
Point' (NGP) mass assignment scheme, which calculates 
the position of each particle based on which grid point 
it is nearest. In FigH]we plot the mean power spectrum 
and error bars of 141 unfiltered, Gaussianized and non- 
Gaussianized density fields. We can see that for small k, 
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Fig. 3. — (a) Map of a density field with a width of 600 h~ 1 Mpc and 512 3 pixels, randomly selected from 141 Af-body simulations. A 
slice of the density field is projected onto the x-y plane for visualization, (b) Same as (a), but the map is Gaussianized by wavelet filtering, 
(c) Same as (a), but the map is non-Gaussianized by wavelet filtering, (b) and (c) add up to (a). 

ance, or the error bars, of the power spectra at different 
scales k. If the measurements were completely uncor- 
related, the diagonal of the covariance matrix would be 
the variance at each value of k, and all the off-diagonal 
entries would be zero. Mathematically, the covariance 
matrix is defined as 



- Uiitlltei'cd 

- Gaussianized 

- Non-Guussianizcd 

- Linear 




Wavenumber k [h Mpc 1 ] 

Fig. 4. — Power spectra and error bars of the density field simula- 
tions, and their Gaussianized and non-Gaussianized power spectra. 
Each curve is determined by the mean of over 141 power spectrum 
samples, and the error bars describe those standard deviations. 
The error was multiplied by 3. The BBKS linear power spectrum 
is plotted in dash line. 

corresponding to large scales, Gaussianized fluctuations 
dominate the power, and that for large k, correspond- 
ing to small scales, non-Gaussianized fluctuations domi- 
nate. Non-Gaussianized fluctuations begin to dominate 
the power in the region k ~ 0.2, which is the boundary 
of the linear power spectrum and the non-linear power 
spectrum. We can see that the non-Gaussian filter in 
wavelet space is quite efficient at separating the linear 
and non-linear power spectra. 

5. INFORMATION CONTENT IN THE POWER SPECTRA 
OF DENSITY FIELDS 

The covariance is the mean value of the product of the 
deviations of two variables from their respective means. 
Simply put, it measures the correlation between the vari- 



es, fc') = 



N 



1 N 

— Y,[Pi(k) ~ (P(k))][Pi(k') - (P(k'))}, 



(18) 

where N is the number of realizations and (P(k)) is the 
mean power spectrum. 

The cross-correlation coefficient matrix, or for short 
the correlation matrix, is a normalized version of the 
covariance matrix, where each value is divided by the 
square root of the diagonal values as follows: 



p(k,k') = 



C{k,k') 



y /C(k,k)C(k',k') 



(19) 



The three correlation matrices for our unfiltered, Gaus- 
sianized and non-Gaussianized densities are shown in 
FigjSJ For the covariance matrix of the unfiltered den- 
sities, the linear regime is diagonal, while in the non- 
linear regime, the power spectra at different fc-modes are 
correlated by at least 40%. In the Gaussian filtered co- 
variance matrix, however, we find a smaller correlated 
region. It seems that the correlations between fc-modes 
are suppressed to some extent. When we look at the 
non- Gaussian filter, we see that the diagonal is now a 
few cells thick, i.e. some non-diagonal elements are cor- 
related even in the linear regime. This is allowed, since 
the 'Gaussian + non-Gaussian' decomposition is done 
on the densities, not on the covariance matrix. Also, the 
corresponding value in the covariance matrix are many 
orders of magnitude smaller than those in the unfiltered 
and Gaussian cases, and thus play no role what so-ever. 

The cumulative Fisher information is defined as fol- 
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(a) (b) (c) 




Wavenumber k [h Mpc 1 ] Wavenumber k [h Mpc x ] Wavenumber k [h Mpc 1 ] 



Fig. 5. — (a) Cross-correlation coefficient matrix as found from 141 power spectra of the density field simulations. The squares in black 
on the diagonal line indicate perfect correlation, (b) Same as (a), but the power spectra are from the Gaussianized density fields, (c) 
Same as (a), but the power spectra are from the non-Gaussianized density fields. 



lows: for a given wavenumber, we select a subsection of 
the covariance matrix to that wavenumber, and we in- 
vert this sub-matrix, and sum over all its elements. The 
normalized covariance matrix is described as follows 



Cnorm(fcj k ) 



C(M') 



(P(fc))(p(fc')>' 

and our cumulative information function is 



I(k n ) 



1 

norm 



(fcj , kj 



(20) 



(21) 



We plot the cumulative information of the unhltcrcd, 
Gaussianized and non-Gaussianized power spectra in 
Fig(6] One can see that in the translinear regime, where 
k ~ 0.2, the cumulative information of the density fields 
has a flat plateau, which indicates that there is nearly 
no independent information in the non-linear regime of 
the power spectrum. In comparison, for the Gaussian- 
ized information, the cumulative information also has 
a plateau, but at a higher value, which indicates that 
it contains about three times more independent Fourier 
modes. This is a key result, especially for parameter es- 
timation obtained through Fisher matrices. These anal- 
yses could be improved by up to a factor of \/i solely by 
filtering their data. 

6. CONCLUSION 

We use the code 'CubeP3M' to generate 141 dark mat- 
ter density fields. In wavelet space, we construct a non- 
Gaussian Wiener filter in order to separate non-Gaussian 
and Gaussian structures. We measure the power spec- 
tra of the unfiltcrcd, Gaussianized and non-Gaussianized 
density fields, and find that the Gaussianized power spec- 
trum is heavily dampened on small scales, while the 
non-Gaussian features dominate at smaller scales, as ex- 
pected from structure formation. We also calculate the 



I 10 ' 
u 




n' 



— ■ — Unfiltered 
n Gaussianized 
o non-Gaussianized 
Lineal" 



Wavenumber k [h Mpc ] 

Fig. 6. — Cumulative information in the power spectra as a func- 
tion of the wavenumber. The black squares filled with grey corre- 
spond to the density field simulations without filtering. The blue 
squares and the blue solid line correspond to the Gaussianized den- 
sities. The red squares and the red dashed line correspond to the 
non-Gaussianized densities. 

cumulative information, and find that the plateau in the 
translinear regime rises in the Gaussianized information 
curve by a factor of three. 
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